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Abstract 

Numerous studies have examined tlie genetic diversity and genetic structure of invading species, witli contrasting results 
concerning the relative roles of genetic diversity and phenotypic plasticity in the success of introduced populations. 
Increasing evidence shows that asexual lineages of aphids are able to occupy a wide geographical and ecological range of 
habitats despite low genetic diversity. The anholocyclic aphid Melanaphis sacchari is a pest of sugarcane and sorghum 
which originated in the old world, was introduced into the Americas, and is now distributed worldwide. Our purpose was to 
assess the genetic diversity and structuring of populations of this species according to host and locality. We used 10 
microsatellite markers to genotype 1333 individuals (57 samples, 42 localities, 15 countries) collected mainly on sugarcane 
or sorghum. Five multilocus lineages (MLL) were defined, grouping multilocus genotypes (MLG) differing by only a few 
mutations or scoring errors. Analysis of a 658 bp sequence of mitochondrial COI gene on 96 individuals revealed five 
haplotypes, with a mean divergence of only 0.19 %. The distribution of IVILL appeared to be strongly influenced by 
geography but not by host plant. Each of the five MLL grouped individuals from (A) Africa, (B) Australia, (C) South America, 
the Caribbean and the Indian Ocean including East Africa, (D) USA, and (E) China. The MLL A and C, with a wide geographic 
distribution, matched the definition of superclone. Among aphids, M. sacchari has one of the lowest known rates of genetic 
diversity for such a wide geographical distribution. 
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Introduction 

Range expansion of exotic species can result from either 
evolutionary adaptation or generalism and plasticity often 
associated with a change in niche [1]. Genetic diversity is 
required for evolutionary adaptation, but a reduction in genetic 
diversity in invasive populations compared to populations in 
their native range is expected and often observed [2-5]. 
Organisms with clonal reproduction may exhibit reduced 
genetic diversity within populations, as better-adapted clonal 
genotypes expand and dominate available resources [6]. In 
aphids, the concept of "superclones" emerged [7] when a few 
asexual genotypes of the same species were able to colonize a 
wide geographical or ecological range of habitats [8—11]. The 
capacity of these populations to adapt to different conditions 
could be the result of a preadaptation capacity for phenotypic 
plasticity rather than local selection acting on genetic diversity 
[12,13]. What is more, this capacity may be enhanced by their 
high rate of reproduction and population expansion [14]. For 
these reasons, clonal aphids are good models to assess the ability 
of asexual lineages to show rapid and widespread adaptive 
changes to ecological conditions [15]. 



The old world genus Melanaphis van der Groot 1917 comprises 
around 20 species mainly associated with Poaceae, most of which 
originate from East Asia [16]. The sugarcane aphid Melanaphis 
sacchari (Zehtner, 1897) (Homoptera, Aphididae), which is 
considered to be mainly anholocyclic, is present in America, 
Australia, Asia and Africa. M. sacchari is known to be invasive in 
continental US [17,18] and in Central and South America [16]. 
The host range of this species is restricted to Poaceae [2,19]. 
Blackman et al. [20] hypothesised that Melanaphis individuals 
originating from sorghum or sugarcane were distinct taxa, referred 
to as M. sorghi and M. sacchari respectively, even though their 
host plant preference was not absolute. In their catalogue, 
Remaudiere and Remaudiere [21] considered M. sorghi to be a 
synonym for M. sacchari, but both forms were still listed as 
separate taxa by Blackman and Eastop [22], and it is still not clear 
whether M. sacchari constitutes a single species or a complex of 
sibling taxa. 

In any case, M. sacchari is a major pest of sorghum and 
sugarcane. On sugarcane, it is considered to be the most common 
and most efficient vector of the Sugarcane yellow leaf virus 
(ScYLV), which causes yellow leaf disease [23,24], a disease of 
worldwide economic importance [25-27]. The aphid is also a 
major pest of sorghum, causing direct damage (sap feeding) and 
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indirect damage (sooty mould) [19]. Varietal resistance against M. 
sacchari is one of the main control tactics suggested both for 
sugarcane [28,29] and sorghum [19,30]. Most plant resistance to 
aphids is .specific to a single aphid species or to a few biotypes 
within a species [31] and it has been demonstrated that variability 
exists among tional lineages of aphids in their response to resistant 
cultivars [32,33]. Therefore, characterisation of the genetic 
diversity of aphids is critical for breeding durable and efficient 
resistance, which has to account for the worldwide diversity of 
these pests and the potential emergence of new invasive biotypes. 

Based on a worldwide sample of aphids covering its area of 
distribution (home range as well as invasive range) and using 
microsateUite markers and sequencing of a fragment of the 
mitochondrial cytochrome c oxidase I gene (COI), the purpose of 
this study was to evaluate 1) the mode of reproduction of M. 
sacchari; 2) mokx ular (■\ idence for the existence of sibling species; 
and 3) its genetic diversity and structuring according to host and 
locality. 

Materials and Methods 

Insect samples 

Here, an 'individual' refers to one individual aphid and a 
'sample' refers a several individuals collected from the same host 
plant species in a given locality and date. The complete set of 
individuals (Table SI) comprised ,57 samples from 42 localities in a 
total of 15 countries or provinces, and from five host plants: 
sugarcane, pearl millet, and three wild or cultivated sorghum 
species {Sorghum bicolor, S. halepense, S. verticilliflorum). The 
three sorghum species were considered as a single host plant, 
hereafter named 'sorghum'. 

Aphids were collected from wild or cultivated plants and placed 
in 70% ethanol in Eppendorf tubes, kept frozen at — 80°C until 
they were processed. Only a few aphids were collected on each 
plant sampled to avoid collecting several individuals from the same 
colony. 

Sampling was carried out from 2002 to 2009 by our team in 
Reunion Island and by colleagues in the other parts of the world 
(see acknowledgements). Geographic coordinates of sampling 
localities are provided in Table SI. No specific permissions were 
required for sampling aphids in these locations. The field studies 
did not involve endangered or protected species. 

DNA extraction, genotyping and sequencing 

DNA extraction. DNA was extracted using the "salting-out" 

protocol of Sunnucks and Hales [34]. Briefly, it consists in 
extracting DNA from whole aphids by crushing them in a TNES/ 
Proteinase K bulfer and precipitating DNA in ethanol. This 
method is simple and fast, and provided sufficient DNA for 
phylogeny and microsateUite PGR analyses. 

Genotyping. According to their polymorphism, ten micro- 
satellite loci (Tab. 1) were selected among the 14 previously 
developed by our team for M. siicchari [35]. PGR reactions were 
performed with labelled primers and multiplexed into two mixes 
(Type-it, standard procedure, Qiagen), and the following thermo- 
cycling protocol was used: denaturation at 95°G for 15 min, 25 
denaturation cycles for 30 s at 94°G, a 1-min 30 s annealing step 
at 54°C, and a 30-s elongation step at 72°C. We used an ABI 
prism 3110 for genotyping after addition of an internal size 
standard for each sample (GeneScan LIZ 500, Applied Biosys- 
tems). Alleles were identified at each locus by comparison with the 
size standard using GeneMapper version 2.5 software (Applied 
Biosystems). 



Sequencing. A total of 91 aphids were chosen among the 
worldwide sample to represent different combinations of region 
and host plant. COI fragments were amplified using the LCO1490 
and HC02198 primers designed by Folmer et al. [36]. PGR was 
carried out using the protocol of Kim and Lee [37]. PGR products 
were purified and sequenced by a subcontractor (Cogenics), and a 
consensus sequence of 658 pb was chosen for later analyses. 

Data analysis 

Clonal diversity analysis. Micro-Checker software [38] was 
run on the whole population. No evidence was found for the 
presence of null alleles. Any single combination of alleles was 
retrieved from genotyping data and arranged as unique multilocus 
genotypes (MLGs). Given the clonal reproduction of M. sacchari, 
we assumed that the different occurrences of the same MLG in a 
sample were the result of local clonal reproduction. We therefore 
retained a single representative of each MLG in each of the 57 
samples for genetic and diversity analysis. 

Using GENGLONE software [39], we computed a matrix of 
pairwise genetic distance between MLGs computed as the 
number of allelic diffcrc-nccs between MLGs [40]. Examination 
of the distribution of these distances enabled us to define a 
threshold below which MLGs were considered to belong to the 
same multilocus lineage (MLL), i.e. genotypes which differed 
slightly due to mutation or scoring errors according to Arnaud- 
Haond et al. [40] . The same matrix of pairwise distances was 
also used to construct a minimum spanning network using 
HAPSTAR software [41]. On the set of identical loci within 
each MLL, we computed p^ex, the probability that the repeated 
MLGs originated from distinct sexual reproductive events. A 
Pscx value lower than 0.01 supported the hypothesis that MLGs 
originated from the same MLL [40]. To describe clonal 
diversity, we computed the clonal richness index as R = (G- 
1)/(N-1), where G is the number of genotypes detected (either 
MLGs for Rmlc; or MLL for Rmll)> and N is the number of 
samples [42]. 

Phylogenetic analysis. Sequence alignments of the GOI 
gene were performed using Geneious software version 5.6.6 [43]. 
Five sequences from M. sacchari individuals collected in India [44] 
were retrieved from GenBank and added to our data. Four 

sequences from three species of the Melanaphis genus were also 
retrieved from GenBank and used as outgroups: M. donacis 
(referenced HQ443314), M. bambusae (referenced EU701747 and 
EU701746) and M. japonica (referenced GU457792). Maximum 
Likelihood inference performed with MEGA6 [45] was used to 
choose the most reliable evolutionary model of base substitution to 
infer the evolutionary history. Based on the AICc criterion, the 
best model proved to be the General Time Reversible model with 
gamma distribution of evolutionary rates among sites (GTR+G) 
[46]. The GTR-hG model was then used witii MEGA6 to 
reconstruct the subsequent phylogenetic tre(^ through the Maxi- 
mum Likelihood method, with 10,000 bootstrap replicates for 
branch support. 

Population genetic analysis. We used GENEPOP [47] to 
compute population genetics parameters for each of the MLLs 
delimited by GENGLONE. We tested departures from Hardy- 
Weinberg equilibrium and heterozygote deficit and excess, and 
calculated population fixation index values (F;,). Genetic differen- 
tiation between MLLs was tested with a G test and pairwise Fgx 
were computed. 
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Results 

Genetic and clonal diversity 

We genotyped a total of 1333 aphids using the ten microsatellite 
markers. When we retained a single representative of each MLG 
in each of the 57 samples, this yielded a dataset containing 98 
individuals. 

Global genetic diversity was low, with 36 MLGs found 
(Table 1). Global clonal richness was also low, with a Rmlc; ~ 
0.361. The distribution of the pairwise number of different alleles 
between MLG appeared multimodal, with a first minimum 
located at a distance of five alleles (Figure SI). Grouping MLGs 
which differed by one to four alleles defined five groups. 
Calculation of p^.^ on the set of identical loci within each of these 
five groups yielded values <0.01, confirming that the MLGs 
within each group were unlikely to have derived from distinct 
reproductive events. We therefore considered that the five groups 
defined five multHocus lineages (MLLs) which grouped slightiy 
distinct MLGs resulting from step mutations or scoring errors 
(Table 1, Figure 1). Considering the five MLLs, clonal richness 
was very low, as shown by the Rmll = 0.041. 

Phylogenetic relationship within samples 

Within our 91 M. sacchari COI sequences, only three distinct 
haplotypes were observed (Figure 2). One haplotype was 
observed in individuals belonging to MLL-A, MLL-B or 
MLL-E (Figure 1, Figure 2). The second haplotype was only 
observed in individuals belonging to MLL-C. The third 
haplotype was observed in individuals belonging to MLL-D. 
No association of haplotypes with the host plant was observed 
(Figure 2). These three haplotypes differed from the two 
available in GenBank from five Indian samples, giving a total 
of five haplotypes and five nucleotide substitutions among 96 M. 
sacchari individuals. The phylogenetic tree built from a 658 bp 



fragment of the COI gene clearly separated the four 
Melanaphis species with >80% bootstrap support (Figure S2). 
But within the M. sacchari sequences, the presence of distinct 
taxa was not supported by bootstrap analysis at the 80% 
threshold. Intraspecific genetic divergence in M. sacchari was 
low, with a mean pairwise divergence of 0.19% (range 
0.000.61%). When the five M. sacchari sequences retrieved 
from GenBank were excluded, the mean divergence was 0.17% 
(range 0.000.30%). The mean divergence of M. sacchari 
sequences with the closest taxa, M. japonica, was 1.06% (range 
0.92% 1.39%). 

Standard population genetics 

Plotting the results of the factorial correspondence analysis with 
GENETIX confirmed the grouping of the 36 MLGs in five MLLs 
(Figure 3). Factor 1 distinguished MLL-A and MLL-E, and a 
group formed by the three MLL-B, MLL-C and MLL-D. Factor 2 
distinguished MLL-E from other MLLs. Factor 3 distinguished 
between MLL-B, MLL-C and MLL-D. 

Genetic differentiation between the five MLLs was strong, with 
a highly significant Fj, ranging from 0.262 to 0.694 (Table S2). 

The five populations comprised by each distinct MLL differed 
significantiy from Hardy-Weinberg equilibrium, and showed a 
clear signature of asexual reproduction with a significant 
heterozygote excess and negative Fjg values (Table S3). 

Geographical and host distribution of MLLs 

Distribution of the MLLs revealed strong geographical struc- 
turing (Figure 4). MLL-A was observed in Africa, MLL-B was 
restricted to Australia, MLL-C exhibited the widest distribution 
area (South America, the Caribbean, the Indian Ocean and East 
Africa), MLL-D was observed in the USA, and MLL-E was only 
observed in China. Kenya was the only country where two MLLs 
were observed simultaneously: one sample (Kenl) contained a mix 




Figure 1. Minimum spanning network of Melanaphis 5ac(r/iaA7 microsatellite distances computed as the number of allele differences 
between MLGs. Each node represents one step in the network, i.e. a distance of one allele. The numbers in the circles represent MLGs 
according to Table 1. Coloured backgrounds represent the Multi Locus Lineages (MLLs). MLGs in the same dashed line box share the same COI 
haplotype. 

doi:1 0.1 371/journal.pone.01 06067.g001 
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of MLL-A and MLL-B, one sample (Ken5) contained MLL-A 
alone, and three samples (Ken2, Ken3, Ken4) contained MLL-B 
alone. 

No host plant structuring was observed: in all the countries 
where both sorghum and sugarcane samples were collected, each 
MLL was found on both host plants (Table 2). 

Discussion 

Molecular analysis revealed a very low genetic diversity among 
57 samples collected in 15 countries on two main hosts, with 36 
MLGs structured in five MLLs. The distribution of MLLs was 
strongly structured by geography but not by the host plant 
(sorghum vs. sugarcane). 

Sequencing the COI 'barcoding' region, a typical locus used for 
species discrimination and phylogeny, particularly in aphids 
[48,49], did not enable the detection of cryptic species in our 
samples. Specifically, we obsei-ved no molecular evidence for a 
clear separation into two species, M. sacchari and M. sorghi. We 
found sequence variations peaking at 0.61%, with a mean value of 
0.19%, both of which are within the range of intraspecific 
divergence observed in the Aphididae family by Footit et al. [50] 
or Lee et al. [51]. 

Reproduction 

Population genetic parameters were consistent with populations 
which only reproduce by apomictic parthenogenesis, as previously 
described by Blackman and Eastop [22]. Each population 
significantiy differed from Hardy-Weinberg equilibrium, with a 
high heterozygote excess. These features are a common conse- 
quence of populations which have reproduced clonally for a long 
time. In a global study of genetic diversity on Aphis gossypii, 
Carletto et al. [52] obtained similar results to ours as they 
observed low genetic diversity, with the predominance of a few 
clones at the worldwide scale reproducing only by apomictic 
parthenogenesis. But later, evidence for sexual reproduction of ^. 
gossypii was found in Iran [53]. High genetic diversity and 
evidence for sexual reproduction was also observed in A. gossypii 
alate spring migrants in France [54] . Similarly, in the Brachycau- 
dus helichrysi (Kaltenbach) sibling species H2, a sexually 
reproducing population was identified in India, despite almost 
exclusively clonal reproduction at the worldwide scale [11]. This 
shows that sexual admixture can still exist in a local population 
even in species which are highly clonal at the worldwide scale. A 
holocycle has been observed in M. sacchari in Asia [16] and this 
suggests that higher genetic diversity may exist in some parts of its 
geographic distribution area even if our sampling did not allow us 
to observe it. 

Geographic genetic structure 

Microsatellite analyses showed that population structuring at 
the worldwide scale was only influenced by geography, delimiting 
five MLLs corresponding to five geographic zones: Africa, China, 
Australia, USA, and South America - Indian Ocean (including 
Kenya) and the Caribbean. Variation within each of the four 
biggest zones (excluding China where only one sample was 
analysed) was low, with MLGs within a zone differing by a few 
step mutations, which we suggest is due to one or few separate 
introductions. These results suggest that each of the five zones was 
colonized separately following the introduction of one or a few 
clones from the region of origin of M. sacchari, which the present 
study did not allow us to identify. At least two of the five MLLs 
covered a very wide geographic area and matched the pattern of a 
single asexual genotype with a high capacity for dispersal which 
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Figure 2. COI haplotype network (top), in wKiich Melanaphis sacchari COI sequences originating from the present study are 
numbered from 1 to 3. M. sacchari GenBank COI sequences from India [44] are numbered 4 (JX051388, JX05189, JX051390) and 5 (HQ1 12185, 
JX051402). Mj = Melanaphis japonica COI sequence from GenBank (GU457792). Distribution as a function of host plant (middle): sorghum (blue) vs. 
sugarcane (red). Distribution as a function of MLL (bottom): A (green), B (blue), C (yellow), D (violin), E (red). 
doi:1 0.1 371 /journal.pone.01 06067.g002 



would have spread across a large area: MLL-A was observed in 
West and East Africa, and MLL-C was observed in South 
America, the Indian Ocean, East Africa, and the Caribbean. Both 
MLLs match the definition of "superclone" [7], characterised by 
geographically and ecologically widespread distribution, which has 
already been documented in several aphid species [9,10, 1 1,55,56]. 

The low rate of genetic diversity observed in the whole 
geographic area covered by our study, and the lack of published 



data about the dates of introduction of M. sacchari in the countries 
sampled, meant we were not able to reconstruct the invasion 
routes of this species. The only exception was continental USA. M. 
sacchari was first described in Hawaii in the late 19* century 
[57,58], and was first recorded in continental USA at the end of 
the 1970s in Florida [59,60] and in 2001 in Louisiana [18]. In our 
study, almost all individuals sampled in Louisiana and Hawaii 
belonged to the same MLG, Ms9, and shared the same COI 
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Figure 3. Factorial correspondence analysis of microsateilite data with GENETIX. Each symbol represents one of the 36 MLGs. Colours and 
letters refer to Multilocus Lineage (iVILL) assignment with GENCLONE. 
doi:1 0.1 371/journal.pone.01 06067.g003 



haplotype, neither of which were observed in other regions. This 
strongly suggests that M. sacchari was introduced in continental 
USA from Hawaii. This finding is noteable, as one would expect 
an introduction into continental USA from either South America 
or the Caribbean, a shorter invasion route. However, Mondor 
et al. [61] emphasized that the relationship between the coloni- 
zation of the Hawaiian Islands by an aphid species and its presence 
in continental USA was due to the high rate of commercial 
exchanges between the two. Here we provide an example of 
reverse colonization from Hawaii to continental USA. In Kenya, 



MLL-C was observed in three samples from the coastal region of 
Kenya but was not found in a sample collected inland. This 
underlines the fact that, even though M. sacchari has been 
recorded in almost all areas where sugarcane is cultivated, the 
possibility for the expansion of some genotypes should be taken 
seriously, mainly its unknown potential impact on the epidemiol- 
ogy of the viral diseases it transmits. 




Figure 4. Relative geographical within-state distribution of Multilocus lineages (MLL). The size of circle is not proportional to the size of 
the sample. Aus = Australia, Bra = Brazil, Col = Columbia, Ecu = Ecuador, Gua = Guadeloupe, Haw = Hawaii, Lou = Louisiana, Mar = Martinique, 
Mau = Mauritius, Run = Reunion Island, Ben = Benin, Cam = Cameroon, Nig = Niger, Chi = China, Ken = Kenya. 
doi:1 0.1 371/journal.pone.01 06067.g004 
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Figure SI Distribution of the pairwise number of 
di£Perent alleles between MLGs. 

(PDF) 

Figure S2 Molecular Phylogenetic analysis by Maxi- 
mum Likelihood method with bootstrap support (10,000 
replicates) using 658 bp cytochrome c oxidase subunit I 
sequences from 100 Melanaphis spp. individuals. 

(PDF) 

Table SI Voucher number, sampling information, 
GenBank accession and SSR genotyping of individual 
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Table S2 Genetic differentiation between MLLs: pair- 
wise Fgx and significance of the G test computed with 
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